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WHAT IS CLAIMED IS : 

1 . A method for facilitating translation of an audio signal that includes speech to 
another language, comprising: 

retrieving a textual representation of the audio signal; 
presenting the textual representation to a user; 

receiving selection of a segment of the textual representation for translation; 
obtaining a portion of the audio signal corresponding to the segment of the textual 
representation; 

providing the segment of the textual representation and the portion of the audio signal to 
the user; and 

receiving translation of the portion of the audio signal from the user. 

2. The method of claim 1, wherein the retrieving a textual representation includes: 
generating a request for information, 

sending the request to a server, and 

obtaining, from the server, at least the textual representation of the audio signal. 

3. The method of claim 1, wherein the presenting the textual representation to a user, 
includes: 

obtaining the audio signal, 

providing the audio signal and the textual representation of the audio signal to the user, 

and 

21 



PATENT 
Docket No. 02-4038 

visually synchronizing the providing of the audio signal with the textual representation of 
the audio signal. 

4. The method of claim 3, wherein the obtaining the audio signal includes: 
accessing a database of original media to retrieve the audio signal. 

5. The method of claim 3, wherein the obtaining the audio signal includes: 
receiving input, from the user, regarding a desire for the audio signal, 
initiating a media player, and 

using the media player to obtain the audio signal. 

6. The method of claim 1, wherein the receiving selection of a segment of the textual 
representation includes: 

identifying a portion of the textual representation selected by the user, 

accessing a server to obtain text corresponding to the portion of the textual representation, 

and 

receiving, from the server, the text corresponding to the portion of the textual 
representation. 

7. The method of claim 6, wherein the text includes a transcription of the audio 
signal and metadata corresponding to the portion of the textual representation. 
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8. The method of claim 1, wherein the obtaining a portion of the audio signal 
includes: 

initiating a media player, and 

using the media player to obtain the portion of the audio signal. 

9. The method of claim 8, wherein the using the media player includes: 
identifying, by the media player, the segment of the textual representation, and 
retrieving the portion of the audio signal corresponding to the segment of the textual 

representation. 

10. The method of claim 9, wherein the identifying the segment includes: 
identifying time codes associated with a beginning and an ending of the segment of the 

textual representation. 

11. The method of claim 9, wherein the segment of the textual representation includes 
a starting position in the textual representation; and 

wherein the identifying the segment includes: 

identifying a time code associated with the starting position in the textual representation. 

12. The method of claim 1, wherein the providing the segment of the textual 
representation and the portion of the audio signal to the user includes: 
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displaying the segment of the textual representation in a same window as will be used by 
the user to provide the translation of the portion of the audio signal. 

13. The method of claim 1, wherein the providing the segment of the textual 
representation and the portion of the audio signal to the user includes: 

visually synchronizing the providing of the portion of the audio signal with the segment 
of the textual representation. 

14. The method of claim 13, wherein the segment of the textual representation 
includes time codes corresponding to when words in the textual representation were spoken. 

15. The method of claim 14, wherein the visually synchronizing the providing of the 
portion of the audio signal with the segment of the textual representation includes: 

comparing times corresponding to the providing of the portion of the audio signal to the 
time codes from the segment of the textual representation, and 

visually distinguishing words in the segment of the textual representation when the words 
are spoken during the providing of the portion of the audio signal. 

16. The method of claim 1, wherein the providing the segment of the textual 
representation and the portion of the audio signal to the user includes: 

permitting the user to control the providing of the portion of the audio signal. 
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17. The method of claim 16, wherein the permitting the user to control the providing 
of the portion of the audio signal includes: 

allowing the user to at least one of fast forward, speed up, slow down, and back up the 
providing of the portion of the audio signal using foot pedals. 

18. The method of claim 16, wherein the permitting the user to control the providing 
of the portion of the audio signal includes: 

permitting the user to rewind the portion of the audio signal at least one of a 
predetermined amount of time and a predetermined number of words. 

19. The method of claim 1, further comprising: 
publishing the translation to a user-determined location. 

20. A system for facilitating translation of speech between languages, comprising: 
means for obtaining a textual representation of the speech in a first language; 
means for presenting the textual representation to a user; 

means for receiving selection of a portion of the textual representation for translation; 

means for retrieving an audio signal in the first language that corresponds to the portion 
of the textual representation; 

means for providing the portion of the textual representation and the audio signal to the 
user; and 

means for receiving translation of the audio signal into a second language. 
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21. A translation system, comprising: 

a memory configured to store instructions; and 

a processor configured to execute the instructions in memory to: 

obtain a transcription of an audio signal that includes speech, 

present the transcription to a user, 

receive selection of a portion of the transcription for translation, 
retrieve a portion of the audio signal corresponding to the portion of the 
transcription, 

provide the portion of the transcription and the portion of the audio signal to the 
user, and 

receive translation of the portion of the audio signal from the user. 

22. The system of claim 21 , wherein when obtaining a transcription, the processor is 
configured to: 

generate a request for information, 
send the request to a server, and 

obtain, from the server, at least the transcription of the audio signal. 

23. The system of claim 21, wherein when presenting the transcription to a user, the 
processor is configured to: 

obtain the audio signal, 
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provide the audio signal and the transcription of the audio signal to the user, and 
visually synchronize the providing of the audio signal with the transcription of the audio 

signal. 

24. The system of claim 23, wherein when obtaining the audio signal, the processor is 
configured to: 

access a database of original media to retrieve the audio signal. 

25. The system of claim 23, wherein when obtaining the audio signal, the processor is 
configured to: 

receive input, from the user, regarding a desire for the audio signal, 

initiate a media player, and 

use the media player to obtain the audio signal. 

26. The system of claim 21, wherein when receiving selection of a portion of the 
transcription, the processor is configured to: 

identify a range of the transcription selected by the user, 

access a server to obtain text corresponding to the range of the transcription, and 
receive, from the server, the text corresponding to the range of the transcription. 



27. The system of claim 26, wherein the text includes metadata corresponding to the 
range of the transcription. 
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28. The system of claim 21, wherein when retrieving a portion of the audio signal, the 
processor is configured to: 

initiate a media player, and 

use the media player to obtain the portion of the audio signal. 

29. The system of claim 28, wherein the media player is configured to: 
identify the portion of the transcription, and 

retrieve the portion of the audio signal corresponding to the portion of the transcription. 

30. The system of claim 29, wherein when identifying the portion, the media player is 
configured to: 

identify time codes associated with a beginning and an ending of the portion of the 
transcription. 

3 1 . The system of claim 29, wherein the portion of the transcription includes a 
starting position in the transcription; and 

wherein when identifying the portion, the media player is configured to: 
identify a time code associated with the starting position in the transcription. 

32. The system of claim 21, wherein when providing the portion of the transcription 
and the portion of the audio signal to the user, the processor is configured to: 
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present a split screen in a translation window, the translation window including a 
translation section and a transcription section, and 

display the portion of the transcription in the transcription section. 

33. The system of claim 21, wherein when providing the portion of the transcription 
and the portion of the audio signal to the user, the processor is configured to: 

visually synchronize the providing of the portion of the audio signal with the portion of 
the transcription. 

34. The system of claim 33, wherein the portion of the transcription includes time 
codes corresponding to when words in the transcription were spoken. 

35. The system of claim 34, wherein when visually synchronizing the providing of the 
portion of the audio signal with the portion of the transcription, the processor is configured to: 

compare times corresponding to the providing of the portion of the audio signal to the 
time codes from the portion of the transcription, and 

visually distinguish words in the portion of the transcription when the words are spoken 
during the providing of the portion of the audio signal. 

36. The system of claim 21, wherein when providing the portion of the transcription 
and the portion of the audio signal to the user, the processor is configured to: 

permit the user to control the providing of the portion of the audio signal. 
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37. The system of claim 36, further comprising: 

foot pedals configured to aid the user to at least one of fast forward, speed up, slow down, 
and back up the providing of the portion of the audio signal. 

38. The system of claim 36, wherein when permitting the user to control the providing 
of the portion of the audio signal, the processor is configured to: 

permit the user to rewind the portion of the audio signal at least one of a predetermined 
amount of time and a predetermined number of words. 

39. The system of claim 21, wherein the processor is further configured to: 
publish the translation to a user-determined location. 

40. A graphical user interface, comprising: 

a transcription section that includes a transcription of non-text information in a first 
language; 

a translation section that receives a translation of the non-text information into a second 
language; and 

a play button that, when selected, causes: 

retrieval of the non-text information to be initiated, 
playing of the non-text information, and 



30 



PATENT 
Docket No. 02-4038 

the playing of the non-text information to be visually synchronized with the 
transcription in the transcription section. 

41. The graphical user interface of claim 40, wherein the transcription visually 
distinguishes names of people, places, and organizations. 

42. The graphical user interface of claim 40, further comprising: 

a configuration button, that when selected, causes a window to be presented, the window 
permitting an amount of backup to be specified, the amount of backup including one of a 
predetermined amount of time and a predetermined number of words. 

43. The graphical user interface of claim 42, wherein the window further permits a 
name to be given for the translation and a location of publication to be specified. 

44. The graphical user interface of claim 40, wherein the play button further causes 
words in the transcription to be visually distinguished in synchronism with the words in the non- 
text information being played. 

45. The graphical user interface of claim 40, wherein the non-text information 
includes at least one of audio and video. 
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46. The graphical user interface of claim 40, wherein the graphical user interface is 
associated with a word processing application. 
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